CDS

Accession Number TCMCG036C20097
gbkey CDS
Protein Id PTQ30856.1
Location join(541822..543171,544990..545160,545918..546215,546521..546636,546837..547120,547388..547454,547716..547949,548137..548280,548725..548850,549148..549336,549587..549706,549889..550107)
GeneID Phytozome:Mapoly0119s0054
Organism Marchantia polymorpha
locus_tag MARPO_0119s0054

Protein

Length 1105aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA53523, BioSample:SAMN00769973
db_source KZ772791.1
Definition hypothetical protein MARPO_0119s0054 [Marchantia polymorpha]
Locus_tag MARPO_0119s0054

EGGNOG-MAPPER Annotation

COG_category O
Description Ubiquitin-activating enzyme e1 C-terminal domain
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko04121        [VIEW IN KEGG]
ko04147        [VIEW IN KEGG]
KEGG_ko ko:K03178        [VIEW IN KEGG]
EC 6.2.1.45        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko04120        [VIEW IN KEGG]
ko05012        [VIEW IN KEGG]
map04120        [VIEW IN KEGG]
map05012        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGTGACGATGGATGTAGATGGGGCGAATAATTCCTTGAGCCCTGCGAAAGAAATTGACGATCTCAGATACTCTCGGTTGATTTACACTCTGGGACGAGAGGCAGTGCAGGCCATATATCAGAGCAAGGTATTAATCCTCGGTTGCAACGGCGTGGGAGCAGAAATTGCGAAGAACCTGGTGCTGTCTGGTGTGCGAGGTTTGGGCTTGGTCGACGATGAAGCTGTGTCTATTGGTGATTTGTCCGCGCAATTTCTTCTCACGGAAGCAGACATTGGTCGGAACCGAGCGGGTGCATCTGCTGCAAAGCTTAAAGAAATGAATCCCACAGCAGAAATTACACTGATACCAGTCTTACAGTTGGAAAGCTGCTTGAGTTCTTACCAGGTGTTTGTGGCAACAACAGGTAACATGCCCTATTTGATGGAGATAAATCTACTCTGTCGGTCACTTGGAGTTCCTTTCATTCTGGCTACGTCTCGTGGACTGTTTTCTCAGGTATTCGCAGATTTTGGGGACAACTTCTACGTTGTGGATGAAACTGGAGAGCCTTCAGGGGCCATCCTTGTGGAGAGCATCACCCAGGACTTTCCCGCTACGGTCACTGTGGTAGAGGAACAGAGGCATGGTTTAGAGGATGGCGATCAGGTAGTACTAAGGGGAATTAAAGGTATGGAAGAGCTCAACCGTGATGCTTCCTTCACCGTTGCCGGGACTGGAACCAGTTCTTTCACTATACCCGAAGACACTCGAAATTTTGGCCGATACCTATCAGGCGGATATTTTCATAAAGTCAAGCAAGAAAAAGTCGTGCAGTTTCTCTCATTGGAAGGATCCCTGCACTCGCCGAAATTTGGTTTGAGCGACCCTGCAAAAGTAGCACGGAATCCCCATTTACATGTTGCCTTTCAGGCAATCAGCGAATTTGAAAGAAGGAAGAGTGGAGGAGCATCTATCGCCACAGGTGCACGCTTGGTCAAAGAGGAGGATCACGGCGAAATACTGGAGTATGCCAGAGAGATTTGGGAGCAGTTAGGGTTTCATGAACGAGTCAGGGAATCCAGTAATGGAAGCTTTGGAATGGAAGAAATTGCGGAAGTAGTTGGAGACGGTAGAGGTATAATTGACAGTGTCGACATTCATCCTTCAAGCGGAGTCGCCGGGGTTGGGTGCAGTAGCATTGATTTGGGAGGAATTGCGGAAGCTGGGAGCTCGAAGGAAAGCAAAGCCAGCAGGAAAGTTGACACCCTCGATGAAGAACTTGTGAAGCTGTTGGCTCGAGGCGCCCATGTTGAATTGAGTCCTATTGCTGCGATAACTGGGGGAATCGCTGCCCAGGAAGTCATGAAGGCTATCTCAGGAGTCTTCACGCCACTGAGTCAGTGGCTGTATTTCGATGCGTTGGAATGCCTACCGTCAGTCGCGCCACCTCCAGAGGAGACGATAGCAAGTGGTTCTCGTTACGACCTGCAGACAGCACTATTTGGCAGAAAGTTTCAAGAGCTGCTTGGTAGTCTTCAGTGGTTGGTGGTAGGTGCAGGTGGTCTCGGTTCAGAGGTGTTGAAAAATTTGGTGATGATGGGTGTTGGCTGTAATCCTGATGGTAATATTACGGTTACAGACATGGACCGAGTATCGAGGCCGAATCTCGTTGATCAGCTTCTATATCAGATCGACGATCTAGATCGACCCAAGACTCCTGCAGCAGCAAGAGCCTTGAGGAACATAAACCCTGCTGCTCAGATCCATGCTTTGCAAGAAAAATTTGATACAGATACAGAGGGGATTTTTGACTCTTCGTTTTTCAAGTCTGTCGCAGGAGTTTTCTCTGCTGTGGATAGCGCGCAAGCCAGATTGTATATTGACAACAGATGTGTTACACATCGTAAGCCTATGATCGATGGAGGCAAGCATGGCACCAAAGGAAGTGTACAGGTGTTCGTTCCATTTCAATCAGAAATGTATGCCTCTAGTAATGATCCACCGGAGCACAAAGACATGCCTATTTGCACTTTGAAGAACTTTCCCTATTCAGCAGAGCATACACTACAATGGGCAGTAGAGACCTTTGAGACGTTGTTCAAGAAGCGACCCCTGGATGTCAACGCATATTTGTCAAACCGTGATTTCCAAGATTCGATTCGGAAGTCTCCTCCAACATCCCGTCTCCCCATACTAGAAACTTTGCGAGATGCTCTTCTACGACATAGACCTCTCAGTTTCGAAGCTTGTGTGCAATGGGCACGGTTGCAGTTTGAAGAACTTTTCGTCAATATCATTAAGCAGCTTTGTTTTACATTTCCACCAGGCATGACAACCACTGCCGGAGCACCTTTCTGGAGCGGGACCAAGAGAGCTCCGGCACCCCTTACGTTTGATCCCTTAAATCCGTTGCATCTGGAATTCATTGTTGCTGCCGCCAACCTTCAAGCAACTGTCTACGGGTTGAAGGGTTGTCAGGAGCATGCCGTCTTCCTTGATATCCTTCAGAATGTGGAAGTTCCAGCCTTTGAACCCAAAGAAGGTGTAAAGATTGCAGTTTCGGATAGTGAGTATCGGAATATGGGTAGCCAAAGGGGCATGCGGCCCGGCTCAGAGGATAGTGCTGCTGTGGAAGCATGCGAAGCTTTACTTCAGGAGCTCCCCACTCCTGCCACACTTGCGGGTTATCGGCTTACACCCATTGACTTCCAAAAGGATGATGAACGGAATTTCCACGCTGAATTCGTTGCCACAGCAGCCAGTCTACGAGCTTGCAACTATGGCATCCCAGTCAGTGACAAACTACAGGCAAGATTTGTGGGTGGAAAAATCATCCCGGCAATCATCACATCAACTGCAATGGTTGGAGGTCTCATGTGTCTAGAACTGTACAAAATACTACTTCAAAAGCCTCTGACAGACTATAAGCATGCGTACTTCAATCTCGCGGTTCCTCTATTTACATTTGCTCAGCCCATCAGAGCTGTACAGAATACGGTCGCAAGGCGTCAAGATACTCCGCTAACATGGACGCTATGGGACAGATTTGAAATGGAATGTGTTGGTATGACATTAGAGGCATTTTTGGCAGAGTTCAAGCGCCAACAAGGACTTGAGATCACTATGCTCTCCTTCGGAAAAAGTCTCCTGTACGCGGAGTTTCTTCCCCGCAAAAAGTTGCAGGACAGACTACCTCTCCCATTGTTAGAACTCATCACGGTGATTGGGAAAGTAACTATCCCTGCTACTGAGAGCAGAATCATTTTTTCAATCTCCTGCACCGATGCAAACGACGATGATGTCGAAGTACCTGATGTTGTCGCCCGTGTTCGCTGA
Protein:  
MVTMDVDGANNSLSPAKEIDDLRYSRLIYTLGREAVQAIYQSKVLILGCNGVGAEIAKNLVLSGVRGLGLVDDEAVSIGDLSAQFLLTEADIGRNRAGASAAKLKEMNPTAEITLIPVLQLESCLSSYQVFVATTGNMPYLMEINLLCRSLGVPFILATSRGLFSQVFADFGDNFYVVDETGEPSGAILVESITQDFPATVTVVEEQRHGLEDGDQVVLRGIKGMEELNRDASFTVAGTGTSSFTIPEDTRNFGRYLSGGYFHKVKQEKVVQFLSLEGSLHSPKFGLSDPAKVARNPHLHVAFQAISEFERRKSGGASIATGARLVKEEDHGEILEYAREIWEQLGFHERVRESSNGSFGMEEIAEVVGDGRGIIDSVDIHPSSGVAGVGCSSIDLGGIAEAGSSKESKASRKVDTLDEELVKLLARGAHVELSPIAAITGGIAAQEVMKAISGVFTPLSQWLYFDALECLPSVAPPPEETIASGSRYDLQTALFGRKFQELLGSLQWLVVGAGGLGSEVLKNLVMMGVGCNPDGNITVTDMDRVSRPNLVDQLLYQIDDLDRPKTPAAARALRNINPAAQIHALQEKFDTDTEGIFDSSFFKSVAGVFSAVDSAQARLYIDNRCVTHRKPMIDGGKHGTKGSVQVFVPFQSEMYASSNDPPEHKDMPICTLKNFPYSAEHTLQWAVETFETLFKKRPLDVNAYLSNRDFQDSIRKSPPTSRLPILETLRDALLRHRPLSFEACVQWARLQFEELFVNIIKQLCFTFPPGMTTTAGAPFWSGTKRAPAPLTFDPLNPLHLEFIVAAANLQATVYGLKGCQEHAVFLDILQNVEVPAFEPKEGVKIAVSDSEYRNMGSQRGMRPGSEDSAAVEACEALLQELPTPATLAGYRLTPIDFQKDDERNFHAEFVATAASLRACNYGIPVSDKLQARFVGGKIIPAIITSTAMVGGLMCLELYKILLQKPLTDYKHAYFNLAVPLFTFAQPIRAVQNTVARRQDTPLTWTLWDRFEMECVGMTLEAFLAEFKRQQGLEITMLSFGKSLLYAEFLPRKKLQDRLPLPLLELITVIGKVTIPATESRIIFSISCTDANDDDVEVPDVVARVR